Clotho: Decoupling page layout from storage organization
نویسندگان
چکیده
As database application performance depends on the utilization of the disk and memory hierarchy, and the speed gap between the processor and memory components widens, smart data placement plays a central role in increasing locality and in improving memory utilization. Existing techniques, however, do not optimize accesses to all levels of memory hierarchy and for all the different workloads, because each storage level uses different technology (cache, memory, disks) and each application accesses data using different (often conflicting) patterns. This paper introduces Clotho, a new buffer pool and storage management architecture. Clotho decouples in-memory page layout from data organization on non-volatile storage devices, enabling independent data layout design at each level of the storage hierarchy. Using Clotho, a DBMS can maximize cache and memory utilization by (a) transparently using appropriate data layouts on memory and non-volatile storage, and (b) dynamically synthesizing data pages to follow application access patterns at each level as needed. Clotho enables (a) independently-tailored page layouts for dynamically changing as well as compound workloads, and (b) use of alternative technologies at each level (e.g., disk arrays or MEMS-based storage devices). We describe the Clotho design and implementation using disk array logical volumes and simulated MEMS-based storage devices, and we evaluate performance under a variety of workloads.
منابع مشابه
Clotho: Decoupling memory page layout from storage organization
As database application performance depends on the utilization of the memory hierarchy, smart data placement plays a central role in increasing locality and in improving memory utilization. Existing techniques, however, do not optimize accesses to all levels of the memory hierarchy and for all the different workloads, because each storage level uses different technology (cache, memory, disks) a...
متن کاملThesis Proposal Efficient data organization and management on heterogeneous storage hierarchies
As a central part of database systems, data organization has a direct impact on functionality and performance of data management applications. Current data organization, based on the conventional linear abstract of storage devices, linearizes multidimensional data along a preselected dimension when storing them to disks. Therefore existing data organizations have inherent performance tradeoffs ...
متن کاملEfficient data organization and management on heterogeneous storage hierarchies
Due to preferences for design and implementation simplicity, current data organization and management in database systems are based on simple assumptions about storage devices and workload characteristics. This has been the major design principle since the inception of database systems. While the deviceand workloadoblivious approach worked well in the past, it falls short when considering today...
متن کاملClotho: Transparent Data Versioning at the Block I/O Level
Recently storage management has emerged as one of the main problems in building cost effective storage infrastructures. One of the issues that contribute to management complexity of storage systems is maintaining previous versions of data. Up till now such functionality has been implemented by high-level applications or at the filesystem level. However, many modern systems aim at higher scalabi...
متن کاملAdapting the Data Organization of Secondary Storage in Virtualized Environments
In virtualized environments, multiple virtual machines (VMs) usually share a common secondary storage system which is, thus, often subject to a broad range of access patterns and different requirements (e.g., regarding performance, capacity, and reliability) imposed by diverse applications running inside the VMs. Moreover, with applications and VMs being added, started, stopped, and removed, ac...
متن کامل